oV 2 and Yersinia pastis) based on 3 mer words. The frequency
3-mer words demonstrated a significant difference between three
s.
3-mers pattern for MT042778 (M), AB889999 (A) and QANK01002681 (Q).
7.20 shows the correlation coefficients, where it can be seen that
lation coefficient between the 3-mer vector (ܠௌோௌି) of the
oV and the 3-mer vector (ܠௌோௌିିଶ) of the SARS-CoV-2
s was the greatest, being 0.868. The correlation coefficient
of ܠௌோௌି and the 3-mer vector (ܠ௦) of the Yersinia
as small, being 0.192. The correlation coefficient between
ିଶ and ܠ௦ was the least, being 0.079.
Correlation coefficients for 3-mer frequency of MT042778, AB889999 and
02681.
MT042778
AB889999
QANK01002681
MT042778
1.000
0.868
0.079
AB889999
0.868
1.000
0.192
QANK01002681
0.079
0.192
1.000
erarchical cluster model was constructed for this 3-mer data
d from three sequences. The model shows that MT042778 (M for
oV-2) and AB889999 (A for SARS-CoV) were merged first
ANK01002681 (Q for Yersinia pastis) was merged with the
f MT042778 and AB889999 with a larger distance. Figure 7.9